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Abstract — Molecular communication is an expanding body of 
research. Recent advances in biology have encouraged using 
genetically engineered bacteria as the main component in the 
molecular communication. This has stimulated a new line of 
research that attempts to study molecular communication among 
bacteria from an information-theoretic point of view. Due to high 
randomness in the individual behavior of the bacterium, reli- 
able communication between two bacteria is almost impossible. 
Therefore, we recently proposed that a population of bacteria in a 
cluster is considered as a node capable of molecular transmission 
and reception. This proposition enables us to form a reliable 
node out of many unreliable bacteria. The bacteria inside a node 
sense the environment and respond accordingly. In this paper, 
we study the communication between two nodes, one acting as 
the transmitter and the other as the receiver. We consider the 
case in which the information is encoded in the concentration 
of molecules by the transmitter. The molecules produced by the 
bacteria in the transmitter node propagate in the environment 
via the diffusion process. Then, their concentration sensed by 
the bacteria in the receiver node would decode the information. 
The randomness in the communication is caused by both the 
error in the molecular production at the transmitter and the 
reception of molecules at the receiver. We study the theoretical 
limits of the information transfer rate in such a setup versus 
the number of bacteria per node. Finally, we consider M-ary 
modulation schemes and study the achievable rates and their 
error probabilities. 

I. Introduction 

The use of bacteria as means of communication is inspired 
from naturally occurring communication between bacteria 
through a process called Quorum Sensing (QS). Molecular 
communication between bacteria is conducted in such a way 
that a population of bacteria can reliably infer information 
about their environment (TJ. Bacteria use molecules to ex- 
change information among themselves to be able to perform 
a task otherwise impossible (TJ, (2). Some examples for this 
coordinated task are light production and attacking the host 
by bacteria. In QS, each individual bacterium in a population 
releases specific types of molecules to the environment. The 
concentration of molecules in the environment (sensed by 
the same population of bacteria) is a measure of the local 
density of bacteria. Bacteria performs their task when the 
concentration of molecules surpasses a threshold. 

New applications and designs are constantly emerging from 
manipulation of the genetic content of QS bacteria. In (3), 
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a simple genetic circuit is used with QS in order to design 
logical gates, i.e., the output of bacterium coincides with a 
logical table according to presence or absence of specific 
molecules in its vicinity. QS is used in (4) to design bio- 
logical clocks, i.e., regulation of the output of a population 
of bacteria to alternate periodically. There has been also new 
research in network engineering inspired by this phenomenon. 
For example, models for forming a network via molecular 
communication are given in (5), (6). In these studies the 
information is encoded in the concentration of molecules. 
This information model departs from another line of research 
which relies on encoding the information in the timing of 
emission of molecules (7), (8). All these studies have inspired 
researchers to investigate the communication among bacteria 
more carefully and also pay attention to information-theoretic 
aspects of bacteria communication (9), fTO) . 

The communication between bacteria is slow and the num- 
ber of bits transferred is small. On the other hand, the relia- 
bility of the communication due to large number of bacteria 
in the environment can be significant. The main motivation is 
to enable reliable communication in the networks that are bio- 
compatible as well. These networks have sensory applications 
and the delay in the communication can be fairly large. 

A. Problem Setup 

As shown by the previous studies in biology (TT), the 
individual behavior of bacteria has a heterogeneous nature 
and may involve high levels of randomness. In order to form 
reliable communication out of unreliable bacteria, we consider 
the communication between a population of bacteria residing 
in a node. In (5), we introduced a Molecular Communication 
Networking (MCN) paradigm where populations of bacteria 
(i.e., the primitive agents), clustered together and acted har- 
monically, form a node in a communication network. Each 
bacterium in a node is able to produce molecules, sense the 
concentration of molecules (from a chemical substance) in the 
environment and respond accordingly. This particular response 
is programmed into the plasmid which is embedded in the 
bacteria to act along with the DNA of bacteria. Such a node 
in MCN is considered to be an independent entity and act as a 
fairly smart node in the network. In such proposed networks, 
the communication happens between the nodes instead of 
the individual bacteria. The sensed information is relayed 
in the network from one node to another through diffusion 
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Fig. 1. The molecular communication setup consisting of the transmitter, 
channel and the receiver 



of molecules in the medium. This setup enables us to take 
advantage of the primitive agents (i.e., the engineered bacteria) 
in a network that is designed to perform a specific task and 
transfer information. The information traveled in the network 
is a specif parameter of the environment, (e.g., existence of a 
chemical substance and/or its amount in the environment). 

Our goal is to model and analyze the molecular commu- 
nication between two nodes in a network described above. 
Toward this, in this paper, we modify our problem to a two- 
node communication in which one node acts as the transmitter 
and the other as the receiver (depicted in Fig.[T] As in (5), the 
transmitter node is assumed to be smart enough to stimulate its 
bacteria to emit molecules into the environment. The bacteria 
are stimulated with type I molecules with a proper concentra- 
tion. These molecules are trapped by the ligand receptors of 
the transmitter bacteria. Upon this, each bacterium produces 
type II molecules with concentration rate that depends on the 
number of its activated receptors. We consider a probabilistic 
model for the reception of type I molecules which in turn 
results in a probabilistic model for the type II output rate. The 
produced type II molecules by the transmitter travel through 
the diffusion channel and reach the receiver node. 

The process of reception of type II molecules by the 
bacteria of the receiver node is similar to that of type I in 
the transmitter node. The receptors of the bacteria in the 
receiver node are designed to trap the type II molecules. The 
difference is that the final output by each bacterium due to 
reception of type II molecules will be in the form of light or 
Green Fluorescent Protein (GFP). The receiver node infers the 
transmitted information from the aggregate output of all the 
bacteria within the node. We assume the diffusion channel to 
be noise free; hence, the stochastic nature of the output is due 
to two factors: the error in the channel input concentration 
(i.e., the transmission noise) and the error in the reception 
of molecules at the receiver. Here, we intend to study the 
maximum rate of information exchange and also analyze M- 
ary signaling schemes with their rates of error. In (12), we 
described the reception of molecules by the bacteria. In this 
paper, we extend that model to the transmitter and study the 



role it plays alongside of the receiver. This further enables us 
to study the capacity of the molecular communication between 
two nodes. 

The rest of the paper is organized as follows. In Sec. [n| the 
process of production of molecules at the transmitter side is 
discussed. Sec. Ull studies both the receiver and the achievable 



rates. Then, Sec. IV introduces a practical signaling scheme 
for the communication setup. Finally, Sec. |V| concludes the 
paper. 

II. Transmitter Model 

Let assume the transmitter node T would like to send a 
concentration Aq to the receiver node R. The chamber of 
the transmitter stimulates the n bacteria (it contains at the 
node) with type I molecules which in turn produce type 
II molecules by the bacteria. These molecules would then 
diffuse through the channel to the receiver. To perform the 
transmission functionality, each bacterium must be able to 
receive and decode the type I molecules, emitted by the 
chamber as stimulus to the node. Each bacterium is assumed 
to have separate receptors for different types of molecules. 
We assume N ligand receptors for each type of molecules. 
Furthermore, the model of the two type receptors is assumed 
to be the same, i.e. the process of reception follows the same 
set of equations. To generate the desired type II concentration 
Aq, the agents are stimulated with type I molecules with the 
appropriate concentration A\ which will be determined later 
in this paper. 

In order to account for the production and reception of 
molecules, we use the model introduced in (13) . This model 
considers a chain of linear differential equations that account 
for the output of bacteria in response to presence of molecules 
in the medium. In this model, each cell receptor (i.e., the ligand 
receptor) is activated with a probability that depends on the 
concentration of molecules in the medium surrounding the cell. 
As shown in (13), the binding probability p at the steady state 
is given by 

Aj 
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Aj + k 

where A is the concentration observed by the bacterium, 
7 is the input gain and n is the dissociation rate of the 
trapped molecules from the cell receptors. The process of 
the production of complex molecules, transcription of genes 
and the process of the production of the output are modeled 
similarly (13). The output of bacteria in the steady state is a 
linear function of the number of the activated receptors. Hence, 
the output noise of the bacterium is caused by the probabilistic 
nature of the ligand reception process (TO) . 

We assume the noise in the transmitter output is originated 
from the discrepancy in the individual behavior of the bacteria 
in the transmitter node T. In other words, even though the 
average behavior of bacteria can be formulated with a set 
of deterministic differential equations, the individual behavior 
of bacteria features randomness. Such randomness can be 
accounted for by considering the constants in ([T} as random 
variables. Two factors contribute to the uncertainty of the 



molecular concentration output of a transmitter node. One is 
the probabilistic nature of the number of activated receptors. 
We model this by assuming each receptor being active as a 
Bernoulli random variable that is 1 with probability p defined 
in ([I]). The other factor is the randomness in p itself from 
one bacterium to another within the node. This is due to the 
variability of the input gain 7 in ([T]) within the population of 
bacteria. We model this variation in the input gain 7 as an iid 
additive noise e 7 . Hence, the entrapment probability p\ upon 
the reception of the concentration A\ by bacteria would be 
given by 

Mi + e 7 ) 



pi 



Al(7 + e 7 ) + «' 



where e 7 is a zero-mean Normal noise with variances a 2 . The 
variance is assumed to be sufficiently small such that we can 
ignore the second and higher orders of ( — ). We assume the 
same p\ for all the receptors of a bacterium, but it varies 
according to e 7 in ([2]) for different bacteria in a node. 

The exact analysis using the expression in ^ would be 
cumbersome. Hence, we only consider the first order terms of 
We define the noiseless input probability as 
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By approximating ([2]), we will have 

Pi(l-Pi) 



Pi =Pi 
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The total number of activated receptors of i th bacterium, Xi, 
is a Binomial random variable with parameters (iV,pi^) where 
Pij is the realization of p\ for the i th bacterium. Recall that 
N is the number of ligand receptors per bacterium for a given 
molecule type. We denote X as the total number of activated 
receptors of all bacteria in the node T. Hence, X = J27=i ^i- 
Using the conditional expectation, we have 

E(Xi) = E(E(Xi\p lti )) = E(N Plii ) = Npl 

where the last equality is due to the fact that the noise e 7 
has zero mean. Hence, we have E(X)=niVp*. By using the 
conditional variance, we have 

Var(X z ) = E(Var(X z \p ljZ )) + Var(E(X z \p ljZ )) 
= E(Np lii (l-p 1 J) + Var(Np 1 J 

2 

= Npl(l - pi) + (iV 2 - N)pf(l -Pt) 2 ^. (5) 

The first term in ([3]) is due to the general uncertainty in a 
Binomial output (i.e., the probabilistic nature of the ligand 
reception) and the second term is due to the noise in the 
parameter p\ . By independent assumption between the outputs 
of different bacteria, the variance of the total output by the 
transmitter node is obtained as 

2 

Var(X) = nNpl(l-pl)+n(N 2 -N)pf(l-pl) 2< ^. (6) 



Since the number of receptors N per bacterium is usually 
large enough, the second term is dominating. Hence, we can 

approximate the variance by nN 2 p\ 2 (l —p\) 2< ^z- 

As discussed above, the production output of bacteria de- 
pends linearly on the number of activated receptors X. Hence 
the total type II molecule output of the node T is equal 
to aX where a is a constant. The produced molecules are 
transferred through the diffusion channel. Hence, the steady- 
state concentration A 2 at R will be 



A 2 = G(r)aX, 



(2) where as shown in fl4|, G(r 



(7) 

, V> for the ideal channel 
model. Here r is the distance between the transmitter and the 
receiver nodes and D is the diffusion coefficient. Moreover, 
from ([7]), we obtain 

A17 



E{A 2 ) = aG(r)nN- 



(8) 



A17 + k 

The required stimulating concentration A\, can be obtained 
by putting the right term in ^ equal to Aq\ the desired 
concentration to be transferred from node T to R. Hence, we 
have 

= ^0 
1 ~/(aG(r)nN- A )' 

In order to make the analysis of the receiver tractable, we 
approximate the concentration in ([7]) with a Normal random 
variable. Since the number of receptors N is large, we 
can use the Central Limit Theorem to approximate Xi by 
J\f(Np*,Var(Xi)) where Var(Xi) is given in Hence, 
the transmitter output X (without including a) would be the 
sum of n Normal variables given by 



X = nNp\+e X i 



(9) 



where ex has a A/"(0, Var (X)) distribution. Hence, the con- 
centration A 2 at the receiver would be 



A 2 = A + e t . 



(10) 



where e t is a zero-mean Normal random variable with variance 
a 2 = G 2 (r)a 2 Var(X). The first term in (10) can be viewed 
as the signal to be decoded by the receiver node and the 
second term is an additive Gaussian noise which has a signal- 
dependent variance. In other words, the transmitter induces 
a concentration A 2 of type II molecules at the receiver node 
which has the desired concentration Aq plus the noise We refer 
to this noise as the transmitter noise perceived at the receiver 
in the molecular communication. 

III. The Receiver and Capacity Analysis 



The concentration A 2 derived in (10) is sensed by the 
bacteria in the receiver node R. The sensing process of type II 
molecules is similar to that of the type I molecules we analyzed 
in the transmitter. Hence it follows the equations in the 
previous section. The difference is that the input concentration 
is noisy itself which introduces additional uncertainty to the 
output of the node R\ which is in the form of light or GFR 



This output is the indication of the decoded information sent 
to R. 

Here, we incorporate the effect of both noises introduced 
in the last section. Again, the noise e 7 accounts for the 
dependency of gain 7 on the bacterium at the receiver node 
and e t accounts for the concentration noise introduced in fLO} . 
Hence, the entrapment probability at the receiver can be 
written as 



P2 



(A + 6 t )( 7 + e 7 ) 
(A + e t )(7 + e 7 ) + «' 



(11) 



Note that the input concentration noise e t affects all the 
bacteria in the same manner. By approximating ( [TT] ) and again 
keeping only the first order terms of the noises, we obtain 

Po(l ~Po) 



P2=P0 
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where we define po = a °+k * ^ ne ^ rst term m tne right hand 



side of ( 12) is due to the actual channel input, the second term 
is the noise due to the reception process at the receiver (i.e., 
the gain 7 varies among the different bacteria in the node R) 
and the third term is contributed by the (transmitter) noise in 
the receiver input concentration A2. We denote by Y{ as the 
output of i th bacterium in the node R. Then, Y = J27=i ^ 
would give the aggregate output of all the n bacteria in the 
node. For the rest of the discussion, we assume that the output 
of the node R is in the form of light [2]. Note that Y is the 
sum of binomial random variables with parameters (A 7 ", i^,*)- 
Here, p2 : i is the realization of P2 for the i th bacterium. 

The expected value of the output light can be obtained 
similar to the transmitter, that is E(Y) = Nnpo. Computing 
the variance of the output will be more involved. Since e t is 
the same for all the bacteria of a node, Y^'s are independent 
given the value of e t . Hence, 

n n 



= nN 2 ^2(p +po(l-po)etY(l - { Po +p (l - Po)e*)) • 

7 

where the last equality is resulted by using po + Po(l — Po) e t 
as po in ^ anu neglecting N relative to ./V 2 . By neglecting 
higher order terms of e t , we obtain 



E(Var(Y\e t ))=nN 2 i4(l-poy 2 . 
On the other hand, we have 

n n 

Var(E(Y, Yi\e t ) = Var ^ N(p + p (l - p )e t ) 



(13) 



i=l 



N*nVo(l-Po) 2 *l 



(14) 



From (13) and (14) and using the conditional variance, we 
obtain 



Var(Y) = nN\^+nG 2 t Yp*(l-p Q y. (15) 

T 
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Fig. 2. Capacity (bits per sample) versus maximum trapping probability 
Pmax for different numbers of bacteria in a node. 



Note that (variance of e t ) is related to through (6). 
With the same argument as in the transmitter case, we may 
approximate Y with a Normal random variable. Hence, the 
output of the node R would be in the form 



Y = nNp + ey, 



(16) 



where ey is a zero mean normal distribution with variance 
Var(Y) given in ( p3| . Note that, the first term in (16) is the 
noiseless signal and the second one is a Normal additive noise. 
In order to calculate the capacity per channel use from T to 
R, we should obtain the optimized distribution of po which 
maximizes I(po] Y) \ the mutual information between the input 
and the output. This, in turn, gives the optimized distribution 
for Aq through ([T]). 

To proceed, we observe that in practice, Aq cannot take 
any value. Hence, we assume the maximum achievable con- 
centration is equal to A^ax- This corresponds to probability 



p 



74 +/c via jlj. This maximum probability is due 
to the maximum power used by the transmitte. By using more 
power, the transmitter can increase the maximum concentra- 
tion of molecules at the vicinity of the receiver node and 
increase p m ax • Therefore, we obtain the optimized distribution 
for po over the interval [0 Pmax] and calculate the capacity 
based on p m ax- 

The structure of the noise in ( [T6] ) is complicated since the 
noise power depends on the signal itself. Hence, we resort to 
use the numerical method of Blahut-Arimoto algorithm (BA) 
to obtain the optimal distribution for po and its corresponding 
capacity. Equation ( [T5] ) implies that the noise power is at its 
maximum at po = ^and goes to zero when po approaches 
to either zero or one. Hence, we expect that the distribution 
of po should take values closer to and Pmax with a higher 
probability. The results from the algorithm confirms this fact 
and the distribution has local maximums at and Pmax- 
Results for the capacity (in bits per sample) with respect to 
Pmax fo_r_ different numbers of bacteria in the nodes is shown 
in 
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In this setup, we assume N = 50, 



.1. As 

we observe from the plot, the capacity increases when we 
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increase Pmax or the number of bacteria n. Moreover, the 
convexity of the plots change at Pmax = \> The reason for 
it is that after at this point, the variance of the noise starts 
to decrease. Note that the maximum achievable capacity is 
limited even if the transmitter used infinite power to make 
Pmax = 1. In practice, N and n are very large. However 
due to the exponential growth of the simulation time, we only 
computed the capacity for small values of N and n. 
IV. M-ary Modulation 

The analysis in the previous section was based on the 
assumption that any continuous values of the concentration 
less than A max can be produced and received by the nodes. 
In practice, we may use only a finite discrete number of levels 
of molecular concentrations. In this section, we consider M-ary 
modulation and study the the information exchange rate and 
the corresponding achieved error rate. The range of the input 
is determined by p m ax- Two factors influence the signaling 
performance: the number of levels of concentration and the 
choices for the values of those levels. We consider the scenario 
in which m symbols to be chosen with uniform spacing from 
the interval [0 Pmax] - The i th symbol level would correspond 
to Pmaxi^i •> < i < m — 1. We show by p Cji the probability 
of error in the detection of i th symbol. Hence, the total 
probability of error is equal to p e = Y1T=q^ w iPe,i> where the 
weights Wi associated with the m symbols must be obtained . 

We assume the error to occur when the detected symbol 
passes the half way from the previous or the next symbol. 



As observed in ( 15 ), the variance of the noise, and hence p e 



depends on the chosen symbol i. Therefore, we have 



p Gii = 1 - Pr(- 



~Pn 



< 6 Y i < 



Pn 



(17) 



K 2(m-l) ~ 11 ~ 2(m-l) y ' 

where eyi comes from a A/"(0, erf) where of can be computed 
by replacing p with ^r[Pmax in {i^} - As discussed in the 
previous section, variance of the noise is the smallest when 
the input is closets to or 1. Hence, it is intuitive to choose 
larger weights for the inputs closer to these two points. In 
our scheme, we use the weights from the optimal distribution 
calculated by the Blahut-Arimoto algorithm. In Fig. [3] we have 
shown the rate of information for different M-ary modulations 
versus the power of the transmitter. In this setup, again we 
have chosen N = 50 and o\ = .1. In addition, the number 
of bacteria in a node is chosen to be n = 100. As shown by 
the plot, reliable communication (i.e., p e = 10 -6 ) is feasible 
for M = 2, 4, 8, 16 and the required power is shown as well. 
For larger number of symbols, reliable communication is not 
possible as for the case of M = 32, the least error rate (by 
maximizing the Pmax) would be 10 -2 . There, smaller error 
rates can be achieved by increasing either n or N (or both). 

V. Conclusion 

In this paper, we studied the molecular communication 
between two nodes that contain populations of engineered bac- 
teria. The error in the molecular production by the transmitter 
and the probabilistic nature of the reception of molecules at 
the receiver contribute to the noise in the communication. 
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Fig. 3. The information rate versus the maximum power of the transmitter 
for different M-ary schemes 

We studied the theoretical limits of the information transfer 
rate for different number of bacteria per node and different 
power levels. We observed that capacity increases with the 
number of bacteria in the nodes. Finally, we analyzed the rates 
and reliabilities in M-ary modulation. We observed that for a 
fixed number of bacteria per node and the number of ligand 
receptors, reliable communication is not possible for large M, 
even with increasing the input power. 
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